A fast and resource-conscious MPI message queue mechanism for large-scale jobs
نویسندگان
چکیده
The Message Passing Interface (MPI) message queues have been shown to grow proportionately to the job size for many applications. With such a behaviour and knowing that message queues are used very frequently, ensuring fast queue operations at large scales is of paramount importance in the current and the upcoming exascale computing eras. Scalability, however, is two-fold. With the growing processor core density per node, and the expected smaller memory density per core at larger scales, a queue mechanism that is blind on memory requirements poses another scalability issue even if it solves the speed of operation problem. In this work we propose a multidimensional queue management mechanism whose operation time and memory overhead grow sub-linearly with the job size. We show why a novel approach is justified in spite of the existence of well-known and fast data structures such as binary search trees. We compare our proposal with a linked list-based approach which is not scalable in terms of speed of operation, and with an array-based method which is not scalable in terms of memory consumption. Our proposed multidimensional approach yields queue operation time speedups that translate to up to 4-fold execution time improvement over the linked list design for the applications studied in this work. It also shows a consistent lower memory footprint compared to the array-based design. Finally, compared to the linked list-based queue, our proposed design yields cache miss rate improvements which are on average on par with the array-based design. Keywords— MPI, Message Queues, Multidimensional Searches, Scalability, Exascale
منابع مشابه
MPI Debugging with Handle Introspection
The Message Passing Interface, MPI, is the standard programming model for high performance computing clusters. However, debugging applications on large scale clusters is difficult. The widely used Message Queue Dumping interface enables inspection of message queue state but there is no general interface for extracting information from MPI objects such as communicators. A developer can debug the...
متن کاملArchitecting a Large-scale Elastic Environment - Recontextualization and Adaptive Cloud Services for Scientific Computing
Infrastructure-as-a-service (IaaS) clouds, such as Amazon EC2, offer pay-for-use virtual resources ondemand. This allows users to outsource computation and storage when needed and create elastic computing environments that adapt to changing demand. However, existing services, such as cluster resource managers (e.g. Torque), do not include support for elastic environments. Furthermore, no recont...
متن کاملAn Overview of MPI Characteristics of Exascale Proxy Applications
The scale of applications and computing systems is tremendously increasing and needs to increase even more to realize exascale systems. As the number of nodes keeps growing, communication has become key to high performance. The Message Passing Interface (MPI) has evolved to the de facto standard for inter-node data transfers. Consequently, MPI is well suited to serve as proxy for an analysis of...
متن کاملSmall clusters - MPI usage for solving MHD equations with the FLASH code
Despite fast development of computer technology some numerical problems are still too complex to be solved using personal computers. In such cases the use of supercomputers can be a good choice. However, running jobs at computer centers involves necessity of queuing. Additionally, problems with fast transmission may arise if output files are large enough. An alternative solution is to build mul...
متن کاملHierarchical Optimization for Fast Resource Allocation
Particular problem in the management of distributed systems is the optimal resource allocation among the subsystems. When several subsystems of the distributed system use or compete for the use of a common resource, the limited capacity of the resource can give rise to constraint system behavior. Respectively, the fast solution of the resource problems strongly influences the application area o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Future Generation Comp. Syst.
دوره 30 شماره
صفحات -
تاریخ انتشار 2014